Mining Frequent Partite Episodes with Partwise Constraints
نویسندگان
چکیده
In this paper, we study the problem of efficiently mining frequent partite episodes that satisfy partwise constraints from an input event sequence. Through our constraints, we can extract episodes related to events and their precedent-subsequent relations, on which we focus, in a short time. This improves the efficiency of data mining using trial and error processes. A partite episode of length k is of the form P = ⟨P1, . . . , Pk⟩ for sets Pi (1 ≤ i ≤ k) of events. We call Pi a part of P for every 1 ≤ i ≤ k. We introduce the partwise constraints for partite episodes P , which consists of shape and pattern constraints. A shape constraint specifies the size of each part of P and the length of P . A pattern constraint specifies subsets of each part of P . We then present a backtracking algorithm that finds all of the frequent partite episodes satisfying a partwise constraint from an input event sequence. By theoretical analysis, we show that the algorithm runs in output polynomial time and polynomial space for the total input size. In the experiment, we show that our proposed algorithm is much faster than existing algorithms for mining partite episodes on artificial datasets.
منابع مشابه
Mining Frequent k-Partite Episodes from Event Sequences
In this paper, we introduce the class of k-partite episodes, which are time-series patterns of the form 〈A1, . . . , Ak〉 for sets Ai (1 ≤ i ≤ k) of events meaning that, in an input event sequence, every event of Ai is followed by every event of Ai+1 for every 1 ≤ i < k. Then, we present a backtracking algorithm Kpar and its modification Kpar2 that find all of the frequent k-partite episodes fro...
متن کاملInferring Neuronal Network Connectivity using Time-constrained Episodes
Discovering frequent episodes in event sequences is an interesting data mining task. In this paper, we argue that this framework is very effective for analyzing multi-neuronal spike train data. Analyzing spike train data is an important problem in neuroscience though there are no data mining approaches reported for this. Motivated by this application, we introduce different temporal constraints...
متن کاملInferring neuronal network connectivity from spike data: A temporal data mining approach
Understanding the functioning of a neural system in terms of its underlying circuitry is an important problem in neuroscience. Recent developments in electrophysiology and imaging allow one to simultaneously record activities of hundreds of neurons. Inferring the underlying neuronal connectivity patterns from such multi-neuronal spike train data streams is a challenging statistical and computat...
متن کاملAn Efficient One-pass Method for Discovering Bases of Recently Frequent Episodes over Online Data Streams
The knowledge embedded in an online data stream is likely to change over time due to the dynamic evolution of the stream. Consequently, in frequent episode mining over an online stream, frequent episodes should be adaptively extracted from recently generated stream segments instead of the whole stream. However, almost all existing frequent episode mining approaches find episodes frequently occu...
متن کاملFuzzy Frequent Episodes for Real-time Intrusion Detection
Data mining methods including association rule mining and frequent episode mining have been applied to the intrusion detection problem. In other work, we have introduced modifications of these methods that mine fuzzy association rules and fuzzy frequent episodes and have described off-line methods that utilize these fuzzy methods for anomaly detection from audit data. In this paper we describe ...
متن کامل